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ABSTRACT 

The ongoing Arecibo Legacy Fast ALFA (ALFALFA) survey is a wide-area, extragalactic Hl-line 
survey conducted at the Arecibo Observatory. Sources have so far been extracted over ~ 3000 deg 2 
of sky (40% of its final area), resulting in the largest Hl-selected sample to date. We measure the 
space density of Hi-bearing galaxies as a function of their observed velocity width (uncorrected for 
inclination) down to w = 20 km s _1 , a factor of 2 lower than the previous generation HI Parkes All-Sky 
Survey. We confirm previous results that indicate a substantial discrepancy between the observational 
distribution and the theoretical one expected in a cold dark matter (CDM) universe, at low widths. In 
particular, a comparison with synthetic galaxy samples populating state-of-the-art CDM simulations 
imply a factor of ~ 8 difference in the abundance of galaxies with w = 50 km s^ 1 (increasing to a 
factor of ~ 100 when extrapolated to the ALFALFA limit of w — 20 km s _1 ). We furthermore identify 
possible solutions, including a keV warm dark matter scenario and the fact that HI disks in low mass 
galaxies are usually not extended enough to probe the full amplitude of the galactic rotation curve. In 
this latter case, we can statistically infer the relationship between the measured HI rotational velocity 
of a galaxy and the mass of its host CDM halo. Observational verification of the presented relationship 
at low velocities would provide an important test of the validity of the established dark matter model. 
Subject headings: galaxies:statistics — dark matter — galaxies: dwarf — galaxies: luminosity function, 
mass function — radio lines: galaxies — surveys 



1. INTRODUCTION 

The current "standard" ACDM cosmological model 
has been extremely successful at reproducing the bulk 
of the observed prope rties of our universe on large scales 
(jKomatsu et al.l 120111 ) . However, given the current lack 
of a firm theoretical understanding of dark energy and 
the lack of a direct o r indirect detection of the dark 
matter ( DM) particle (lAhmed et alj I2009t JAngle et al.l 
I2008allb1 : Hbdo et al.ll2010al lbt lAdriani et al.l 12009ft . it is 
important to test in detail the astrophysical implications 
of the established cosmological paradigm. 

One of the most interesting consequences of assuming 
a cold dark matter (CDM) model is that substructure 
forms first on small scales, resulting in a present-day uni- 
verse populated by a multitude of low-mass halos. More 
formally, the mass distribution of DM halos is described 
by the DM mass function (MF), which is defined as the 
number density of halos as a fun ction of their virial mass ; 
it can be analytically p redicted ([Press fe Schechterlll974l : 
iSheth fc Torm cn 2002) that the MF displays a power- 
law behavior at low halo masses, n cx M a , with a rel- 
atively steep exponent of a ~ —1.9 in the standard 
ACDM context. This analytical expectation, confirmed 
to great accuracy by N-body simulations of structure for- 
mation dWarren et al.ll200a iBovlan-Kolchin et al.l 120091: 
iKlypin et al.l I2010D . leads to the prediction of a large 
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number of low mass halos for every Milky Way-sized 
(MW-sized) halo found in the present epoch. 

This firmly established theoretical result has led 
to a number of observational c hallenges, such as 
the "missing satel lites problem" (IKlypin et al.l 119991: 
Moore et al.fll999l: iDiemand et al.1 120071 : iStrigari et al.l 



20071: iSimon fc Gehal 120071). the "vo id phenomenon" 
(IPeeblesI 120011 : iTinker fc Conrovl 120091) . as well as the 
discrepancy between the sizes of mini- voids observed in 
the local universe and those prod uced in CDM sim- 
ulations (jTikhonov fc Klypinl I2009D . Additional con- 
cerns, again closely related to the distribution of halo 
masses predicted by CDM, are raised b y the flatness 
of th e galactic luminosity function (LF, iBlanton et al.l 
2005: IMontero-Dorta fc P radal 120091) . HI mass function 
(HIMF~ lMartin et al.l I2010D and ga l actic stellar mass 
function (GSMF. IBaldrv et al.1 120081: ILi fc White! 12009ft 
at their faint/low-mass end. These observational distri- 
butions display power-laws with a —1.3, much shal- 
lower than expected from the combination of a CDM uni- 
verse plus a naive linear relationship between halo mass 
and luminosity/baryonic mass. Despite their apparent 
diversity, all statements described above are just differ- 
ent aspects of the same fundamental issue: CDM struc- 
ture formation predicts large numbers of low mass halos, 
seemingly in contradiction with the relative paucity of 
visible low-mass galaxies. Hereafter, we refer to this dis- 
crepancy as the CDM overabundance problen^\. 

The main caveat regards the proper interpretation of 
these observational results. All phenomena mentioned 

3 This statement does not aim at including a second class of 
potential observational challenges to CDM, related to the density 
profile of halos in their central regions (known as the "cusp versus 
core" problem). 
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so far rely on the measurement of quantities indirectly 
related to the mass of the hosting DM halo (e.g. lumi- 
nosity or Hi/stellar mass) and, as a result, do not provide 
a direct means of comparing the MF expected for CDM 
with the MF realized in nature. In fact, a number of en- 
vironmental and feedback effects (see £15. 2|) are expected 
to affect the baryonic content of halos, with low mass 
ones being the most impacted. 

Ideally, one would need a large sample of galaxies with 
directly measured dynamical masses (e.g. through lens- 
ing or satellite kinematics), extending all the way to the 
low mass regime. Unfortunately, current datasets are 
restricted to relatively massive galaxies. The best prac- 
tical alternative would consist of a rich sample of resolved 
HI-interferometric rotation curves of galaxies, spanning 
a wide range in dynamical mass. Atomic hydrogen is 
usually the most spatially extended baryonic component 
in a galaxy, and therefore the best tracer of the rota- 
tion curve at large galactic radii. Such a sample could 
be used to determine the space density of galaxies as a 
function of their measured maximum rotational velocity, 
v rot . This observational statistic, which is referred to as 
the velocity function (VF) of galaxies, is more directly 
related to the halo dynamics than statistics based on lu- 
minosity /baryonic mass and has a largely different set 
of systematic issues. However, current datasets are very 
limited, mostly because HI interferometry is extremely 
time consuming (especially for low HI- mass targets). 

A more economical approach is to rely on wide-area, 
single-dish 21 cm surveys. Thanks to their intrinsic spec- 
troscopic nature, HI surveys automatically obtain the 
spectral Hi-line profile of every detected source. The 
velocity width of each detected galaxy, w, can thus be 
readily extracted, and the associated dataset can be used 
to measure the velocity width function (WF) of galaxies. 
One can furthermore apply inclination corrections to the 
measured widths in order to retrieve intrinsic rotational 
velocities (v rot ), and then estimate the galactic VF. Cor- 
recting for inclination requires however the use of exter- 
nal datasets, usually optical/NIR photometric surveys. 

So far, the most accurate WF and VF for late- type 
galaxies have been based on 4315 and 2646 Hl-selected 
galaxies respectivel y, detected by the HIPASS survey 
(|Zwaan et al.l [2010, hereafter ZwlO). Their measure- 
ment of the VF extends over the velocity range 30 km 
s _1 < v ro t < 300 km s _1 , and suggests a dramatic de- 
parture from the CDM expectation at low velocities 
(vrot ;$ 100 km s™ 1 ). Recent determinations of the 
VF for massive early-type galaxies (which are mostly 
absent from Hl-selected samples) have been obtained 
using the Sloan Digital Sky Survey (SDSS) and Two- 
de gree Field Galaxy Redshift Sur vey (2 dFGRS) datasets 
by IChoi et all ([20071 ) and[Chai (pOlOh . Both the late- 
type and early-type distributions are needed in order to 
derive the "total" galactic VF , since massive early-type 
galaxies are the dominant population at high velocities 



(250 km s 



Vrot 



< 450 km s 1 ) while late- types dom- 



inate the counts at lower velocities (v ro t ^ 250 km s _1 ). 

In this paper we present the Arecibo Legacy Fast 
ALFA (ALFALFA) measurement of the velocity width 
function of Hi-bearing galaxies. The decision not to cor- 
rect the measured widths for inclination is intentional, 
as the WF maintains all the advantages of the VF as 



a probe of the halo mass distribution, while featuring a 
number of observational advantages over the latter (see 
Sec. [3] for more details). The ALFALFA WF is based on 
10,744 Hl-selected galaxies (a more than twofold increase 
over previous datasets) and extends to widths as low as 
w = 20 km s _1 . 

This paper is organized as follows: in Section [2] we 
present the ALFALFA survey and the associated dataset; 
in Section |3] we discuss the observational advantages of 
the WF with respect to the inclination-corrected VF and 
present the ALFALFA measurement of the WF for HI- 
bearing galaxies; in Section [4] we address possible obser- 
vational biases on the determination of the ALFALFA 
WF; in Section [S] we compare the ALFALFA measure- 
ment with the expectations in a CDM universe, and de- 
scribe the possible solutions to the observed discrepancy 
at low widths; in Section [6] we derive the relation be- 
tween Vrot (measured observationally) and Vhaio (calcu- 
lated from N-body simulations), that would be needed 
to reconcile the velocity distributions of CDM halos and 
observed galaxies. We conclude with Section [7] by sum- 
marizing our results. 

Throughout this paper we use a Hubble constant of 
Hq = 70 km s _1 ; hro refers to the Hubble constant in 
units of 70 km s _1 Mpc~ x , while h refers to the Hubble 
constant in units of 100 km s~ x Mpc -1 . 

2. ALFALFA DATASET 

2.1. The survey 

The ongoing ALFALFA survey is a wide-area, blind 
21 cm emission-line survey that takes advantage of the 
increased survey speed offered by the 7-feed Arecibo L- 
band Feed Array (ALFA) receiver at the Arecibo Obser- 
vatory. The ALFALFA data are acquired in a minimally 
invasive drift-scan mode in two passes, ideally separated 
by several months in order to enable the discrimina- 
tion between narrow-band radio frequency interference 
(RFI) and small spectral width cosmic signals. When 
complete, the survey will have detected >30,000 galaxies 
over an area of ~ 7000 deg 2 of sky out to cz s» 18,000 
km s _1 . The ALFALFA survey is more s ensitive than 
the previous gener ation HIPASS survey (jMever et al.l 
120041 IZwaan et all l200l . with a 5er detection limit of 
0.72 Jy km s _1 for a source with a profile width of 200 
km s _1 as compared to a 5cr sens itivity of 5.6 Jy km s" 1 
for the same source in HIPASS (IGiovanelli et al.l 12005). 
In addition to greater sensitivity, ALFALFA has a finer 
velocity resolution (11.2 km s _1 versus 26.4 km s _1 for 
smoothed data) and better angular resolution (3.6' vs. 
13' FWHM), resulting in a more accurate identification 
of optical counterparts. 

2.2. The sample 

ALFALFA catalogs have so far been extracted 
(IGiovanelli et al.ll2007t iSaintonge et alj|2008t iKent et all 



2008; Stierwalt et all 



intongc et al. ^UU St IKe: 
2009; Mar tin et a!T 



200E 



Havnes et al.l 120111 ) for a total area of 2934 deg 2 



The current ALFALFA footprint consists of four distinct 
regions: two in the northern Galactic hemisphere, 
hereafter referred to as the Virgo direction region 
(VdR: 07^30™ < a < 16^30™, 4° < 5 < 16° and 
24° < 5 < 28°), and two in the southern Galactic 
hemisphere, hereafter referred to as the anti- Virgo 



ALFALFA Velocity Width Function 



3 



direction region (aVdR: 22' 1 < a < 03 h , 14° < 8 < 16° 
and 24° < 5 < 32°). From this primary dataset we only 
select extragalactic objects detected at high significance 
(S/N > 6.5, designated Code 1), and we further restrict 
ourselves to the redshift range cz ^ 15,000 km s~ x , 
beyond which interference from the nearby San Juan 
airport causes a significant drop of the ALFALFA 
detection efficiency. This final sample, corresponding 
to — 40% of the ALFALFA survey area (hereafter a. 40 
sample), contains a total of 11,086 galaxies. 

Figure [T] shows the spatial distribution of the a. 40 
sources in the Virgo and anti- Virgo directions respec- 
tively, and puts in evidence the complex large-scale struc- 
ture present in both volumes. Density fluctuations in the 
survey volume can be the dominant source of statistical 
uncertainty in surveys like ALFALFA, where the sam- 
ple size ensures small counting errors. Our statistical 
estimator, described in £13.11 in chosen to minimize this 
structure-induced bias. 

Figure [2] displays some statistical properties of the a. 40 
sample. Histograms (a) and (b) represent the distri- 
bution of heliocentric velocity, Vq, and of signal profile 
widt h, wsq, which are bo t h directly meas ured quanti- 
ties (|Giovanelli et alj|2007t lSaintongell2007f ). The signal 
profile width is measured at the 50% flux level of each 
of the two peaks of the typical double-horned HI pro- 
file (or at 50% of the single peak flux, for single-peaked 
profiles). The value of W50 reported in the ALFALFA 
catalogs is further corrected for instrumental broaden- 
ing. Histogram (c) displays the distribution of galaxy HI 
mass, Mhi, which is a distance dependent (and hence 
derived) quantity. Unlike previous HI surveys, we assign 
distances to nearby galaxies through a peculiar velocity 
flow model (Masters 2005) and use Hubble flow distances 
only for galaxies with cz > 6000 km s _1 (see 8 34.21 for a 
detailed discussion on the impact of distance uncertain- 
ties on ALFALFA results). 

Figure [3] displays the distribution of a. 40 sources in 
the velocity width (1U50) versus integrated-flux (Si n t) 
plane. As expected, the detection limit of the survey is 
a function of signal profile width and correctly scales as 
1/2 

Sint,Um ~ w 50 . Due to the large density of sources near 
the detection limit, we evaluate the completeness limit 
of the survey (red dashed line in Figure [3]) based on the 
actual data rather than on simulations using synthetic 
sources. 

3. THE VELOCITY WIDTH FUNCTION 

We obtain rest-frame galaxy velocity widths, w, by cor- 
recting the cataloged profile widths (1050) for Doppler 
broadening. It is customary to apply additional inclina- 
tion corrections to w, in order to recover intrinsic rota- 
tional velocities, v rot . However, since most extragalac- 
tic sources are unresolved at centimeter wavelengths, 
such corrections rely on external datasets (usually opti- 
cal or NIR photometric surveys) for the determination of 
galaxy inclinations. Here, we choose to make no further 
corrections to w and measure the velocity width function 
(WF) of galaxies, denoted by <j>(w). 

Even though the WF does not directly represent the 
distribution of any fundamental galaxy property, it is ob- 
servationally superior to the (inclination-corrected) VF. 
In particular, it is free of the restrictions and system- 



atics that arise from cross-matching HI and optical cat- 
alogs and correcting for galaxy inclination. For exam- 
ple, the HIPASS primary sample contains 4315 sources 
of whi ch only 2646 have unambiguous optical counter- 
parts (|Dovle et all 12005( 1 . Another 30% of the sources 
in this restricted subsample have low inclination values 
(i < 45°), and are thus subject to large inclination- 
correction errors. As a result, only sw 43% of the galaxies 
in the HIPASS primary sample were used for their de- 
termination of the VF. Furthermore, obtaining accurate 
estimates of the true orientation of irregularly-shaped 
dwarf galaxies is challenging, and the process may in- 
troduce biases in the measurement of the low-velocity 
end of the VF. 

Nonetheless, a measurement of the galactic WF would 
not be useful if it did not provide an accurate means of 
comparing the outcome of N-body simulations with the 
observed universe. Fortunately, it is relatively straight- 
forward to project a given theoretical rotational veloc- 
ity distribution and transform it into its corresponding 
width distribution (see Ej5.1[) . We conclude that the WF 
should be regarded as the prime observational distribu- 
tion for single-dish HI surveys, against which to compare 
theoretical expectations. 

3.1. The ALFALFA Velocity Width Function 

In Figured] we present the ALFALFA width function, 
based on 10,744 galaxies drawn from the a. 40 sample. 
For the calculation of the WF we restrict ourselves to 
a. 40 galaxies which are positioned in the portion of the 
flux- width plane where the ALFALFA survey is complete 
(i.e. above the red dashed line in Figure [3]) and have 
profile widths broader than u/50 > 18 km s" 1 . This cut 
results in the elimination of s» 330 galaxies from the cal- 
culation. An additional 13 very nearby sources are elim- 
inated, for which the flow model assigned distances are 
subject to large uncertainty. 

The WF is calculated in logarit hmic width bins, ac - 
cording to the El/V e ff method (;Z waan et al.l 120051) . 
The Hl/V e ff method is a non-parametric maximum 
likelihood method and, as such, it is insensitive to the 
presence of large-scale structure in the survey volume. 
As its name suggests, i t closely resem bles the tradi- 
tional E 1/V m ax method (|Schmidtl 119681 ) and consists of 
summing the number of detections in each width bin, 
weighted by the inverse of the "effective" volume avail- 
able to each source. More precisely, the space density of 
galaxies belonging to width bin k (k = 1, 2, N w ) is 

<pk = for all galaxies i in width bin k . (1) 

In the case of a spatially homogeneous survey volume, 
Veff,i would coincide with V ma x,i, the latter defined as 
the volume within which galaxy i could be placed and 
still be detectable by the survey. On the other hand, 
if the survey volume displays significant density varia- 
tions, V e ff^ takes into account the relative density of 
the volume available to galaxy i with respect to the mean 
density of the total survey volume. As with all density- 
independent estimators, the overall normalization is lost, 
and has to be calculated afterwards. The normalization 
is fixed by matching the integral of the distribution to the 
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aver age number density of galaxies in the survey volume 
(see lMartin et al.ll2010t Appendix B.l for more details). 

Due to its spectral resolution and sensitivity, AL- 
FALFA can push the low-width limit of the WF to 
w w 20 km s _1 , a factor of 2 lower than the HIPASS sur- 
vey. Over the full measured range (20 km s _1 < w < 800 
km s _1 ) the ALFALFA WF is very well described by a 
modified Schechter function of the fornQ 

^) = ^=ln(10)^(^) a e-(w)V (2) 
a log w \ w* / 

The least squares parameter^ are 0* = 0.011 ± 
0.002 h% Mpc- 3 dex-\ logw* = 2.58 ± 0.03, a = 
—0.85 ± 0.10 and /3 = 2.7 ± 0.3 (uncertainties are sta- 
tistical la errors due to Poisson errors on the individual 
bin values). Note, however, that the final sample con- 
tains 163 sources that lack a confidently identified optical 
counterpart. Some of these sources correspond to tidal 
debris from nearby interacting galaxies and may not be 
hosted by individual DM halos. Excluding these galaxies 
from the WF calculation leads to a somewhat shallower 
narrow-end slope of a — —0.68 ± 0.11. 

ALFALFA finds significantly more high-width galax- 
ies than HIPASS (a factor of ~ 3 at w w 400 km 
s _1 , growing to a factor of ~ 10 at w s» 800 km s -1 ), 
which is also evident from the marked difference in the 
value of the position of the "knee" of the WF for the 
two surveys (logw* = 2.58 ± 0.03 for ALFALFA ver- 
sus logw, = 2.21 ± 0.10 for HIPASS^, in disagreement 
at the > 3cr level). Despite the fact that the nominal 
HIPASS volume is a factor of ~ 5 larger than the a. 40 
volume, ALFALFA is able to find more high- width galax- 
ies thanks to its better sensitivity (see Figured]). The 
same effect can be seen in th e HIMFs published by the 
two surveys, with ALFALFA (jMartin et al.ll2010h finding 
a factor of a few mor e of the highest H i-mass galaxies 
compared to HIPASS (jZwaan et alj|2005j) . 

On the low-width end, ALFALFA finds a rising slope 
(a < 0) which is, however, by no means steep enough 
to match the CDM prediction (see Sec. [5]). Despite the 
vastly different value for the narrow-end slope reported 
by the two surveys (a = 0.10 ± 0.39 for HIPASS ver- 
sus a = -0.85 ± 0.10 for ALFALFA) the HIPASS and 
ALFALFA datapoints are consistent in the width range 
40 km s^ 1 < w < 200 km s -1 . The HIPASS a parameter 
is not well constrained, as their WF does not extend to 
low enough widths and suffers from considerable count- 
ing error in the low- width bins. 

4. BIASES 

4.1. Measurement errors on W50 

Measurement errors on W50 can shift galaxies among 
width bins, altering the bin counts and therefore the 

4 The parameterization here is equivalent to the parameteri- 
zation <j>(w) dw = </>» (w/w t ) a exp — (ui/ui*)' 3 ( / 9/r(a//3)) dw/w 
presented by other authors, except for the normalization factor 
/3/I>//3). 

5 The least squares parameters and their statistical errors were 
determined by the MPFITFUN procedure, written in the IDL pro- 
gramming language. 

6 No errors are reported for the published fit parameters to the 
HIPASS WF. In order to compare with ALFALFA, we derive errors 
by performing a least squares fit to the HIPASS WF datapoints. 



inferred space density. The W50 value for ALFALFA 
sources is subject to two separate sources of error: one 
is statistical in nature and present for all sources, while 
the other is systematic and concerns only a fraction of 
the a. 40 sample. The former is due to the distortion of 
the signal profile shape by noise; the latter results from 
the fact that the measurement of the spectral width of 
a signal relies on the accurate visual identification of its 
spectral boundaries, which is non-trivial for a number of 
sources (especially those found in the vicinity of RFI). 
The final width error reported in the ALFALFA cata- 
logs, Ai«5o, is the sum in quadrature of the random and 
systematic error terms described above. Owing to the 
fact that all a. 40 galaxies are detected with high signal 
to noise and have a clean spectral profile in the vast ma- 
jority of cases, the typical a. 40 width error is relatively 
small and its distribution well behaved. The median er- 
ror is Awso, median ~ 8 km s _1 and ^70% of the sources 
have a fractional error of AW50/W50 ^ 10%. 

In order to assess the effect of Aii^o on the WF, 
we create 50 mock galaxy samples by re-assigning ran- 
dom widths to every galaxy i in the primary ALFALFA 
dataset according to their individual measured width 
(wscm) and error (Att)5o,i). Each mock sample is subject 
to the same cuts as the a. 40 sample and a new realization 
of the WF is calculated ("lx" set). In order to illustrate 
the systematic trends introduced, we also perform an ad- 
ditional set of WF realizations with artificially inflated 
width errors (twice the reported ALFALFA width errors, 
"2x" set). 

The results are shown in Figure [5] overplotted to the 
original ALFALFA WF (datapoints and solid black line) 
are a modified Schechter fit to the mean WF correspond- 
ing to the lx (red solid line) and 2x (dashed red line) 
realizations. Width errors at the ALFALFA error levels 
seem to only slightly affect the high- width end of the WF. 
As evidenced by the 2x run, width errors generally lead 
to a rise of the high- width end, due to a net "diffusion" of 
galaxies from intermediate-width bins with large number 
counts to high-width bins with lower number counts. 

4.2. Distance Uncertainties 

Since velocity width is a distance-independent quan- 
tity, galaxy counts in width bins are not altered by dis- 
tance errors. However, the weights (1/V e ff.i) that each 
galaxy contributes to its bin depend on Hi-mass (see Eqn. 
Q] and dis cussion in £|3.1l). and therefore on the assumed 
distance. iMasters et al.l (|2004h have shown that ignoring 
the local peculiar velocity field can lead to biased esti- 
mates of galaxy statistical distributions, especially for 
surveys drawing a large fraction of their sample from the 
Virgo direction (VdR) . To avoid this bias ALFALFA uses 
redshift distances only for distant (cz > 6000 km s _1 ) 
galaxies and assigns distances to nearby g alaxies t hroug h 
a parametric flow model developed by IMasters! (2005) . 
The model includes two attractors (Virgo Cluster & 
Great Attractor), a dipole component (Local Group pe- 
culiar velocity), a quadrupole component (Local Group 
asymmetric expansion) and a random thermal residual 
of cri oca i ~ 160 km s _1 . Here we assume that most of the 
coherent motion of nearby galaxies is correctly described 
by the flow model, and no significant bias results from 
this systematic component of galaxy peculiar velocities. 
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Contrary to intuition however, even the random compo- 
nent ai OC ai can induce a systematic bias thro ugh the "Ed- 
dington effect" (see for example Figure 6 in lZwaan et al.l 
12001 . 

In order to asses the effect of ai oca i on the WF, we 
proceed as in £14. II and create 50 mock samples by adding 
gaussian noise on the cataloged distance of each a. 40 
galaxy. We calculate the WF corresponding to each sam- 
ple realization, and use the obtained average distribution 
to investigate the effect of distance uncertainties on the 
WF. We adopt the Masters (2005) value of ai oca i 160 
km s^ 1 , but we also perform simulations with double the 
fiducial dispersion (aiocai ~ 320 km s _1 ). 

The results are displayed graphically in FigureO where 
the datapoints and solid line correspond to the original 
ALFALFA WF, and the blue solid and dotted lines cor- 
respond respectively to the results of the oi oca i — 160 km 
s _1 and ai oca i = 320 km s _1 simulation sets. The largest 
effect is an overall increase in the amplitude of the WF 
at intermediate widths, which is probably due to the net 
transport of sources towards lower HI masses and there- 
fore larger values of 1/14//. Unlike in the case of the 
HIMF, the low end slope a does not seem to be affected 
in any systematic way. We conclude that, apart from a 
mild increase in amplitude at intermediate widths, the 
WF is relatively insensitive to distance uncertainties due 
to galaxy peculiar motions. 

4.3. Cosmic Variance 

The WF presented in Figure 0] aspires to represent the 
distribution in a cosmologically representative volume. 
The sensitivity of the ALFALFA survey allows ~ w* and 
broader galaxies to be detected throughout the full a. 40 
volume (V a Ao ~ 3.1 • 10 6 h^ Mpc 3 ), which ensures a cos- 
mologically fair sampling of the MW-sized galaxy pop- 
ulation. On the other hand, low-width galaxies tend to 
be faint systems that can only be detected in smaller 
volumes. As a result, the low- width bins of the WF 
are subject to increased uncertainty caused by the de- 
viation of the galaxy distribution from homogeneity on 
small scales, which is referred to as cosmic variance (see 
Figure [8] for a graphical illustration). 

In order to quantitatively asses the effects of cosmic 
variance on the ALFALFA WF, we jackknife resample 
the a. 40 survey volume, by splitting it into 14 parts 
equally spaced in R.A. Then, we reevaluate the WF ex- 
cluding each part in turn. The resulting scatter for each 
parameter, x, is given by 

^ = ^£(*-50 2 , N = U. (3) 

The scatter calculated by Eqn. [3] would be equal to the 
purely statistical error if the survey volume were homo- 
geneous, and so any excess noise results from the pres- 
ence of inhomogeneities. The method described above 
provides a measurement of cosmic variance on linear 
scales smaller than those probed by the full survey, and 
hence yields a conservative estimate of the true uncer- 
tainty (cosmic variance generally increases with decreas- 
ing scale). 

The full uncertainties on the fit parame- 
ters (including cosmic variance) are 4>* = 
0.011 ± 0.003 (0.002) h% Mpc^dex" 1 , 



logw* = 2.58 ± 0.04 (0.03), a = -0.85 ± 0.19 (0.10) 
and /3 = 2.7 ± 0.3 (0.3), where the term in parentheses 
represents the purely Poisson error reported in ^3.11 
Indeed, parameters u>* and /3, which dictate the shape of 
the WF at high widths, show a very modest increase in 
their uncertainty due to cosmic variance. On the other 
hand, the narrow end slope a is significantly affected, 
with cosmic variance contributing a large fraction of the 
full error. 

4.4. Beam confusion 

Beam confusion arises from the fact that the ALFA 
3.3' x 3.8' beam occasionally produces blends of small 
galactic groups at moderate distances, especially when 
individual galaxies are poorly separated in redshift space. 
The qualitative effect of such blends is to transform two 
or more independent sources into a single HI profile of 
larger W50 than each of its constituents. We do not at- 
tempt to quantify the effect of confusion bias, but we 
anticipate it to be more pronounced at the high-width 
end of the WF. This is because galaxies with w > 550 
km s are preferentially found at large distances, where 
beam confusion is more severe. It is worth noting that 
this bias, even though present, cannot account for the 
discrepancy between the ALFALFA and HIPASS WFs 
at high widths, since the latter suffers from more confu- 
sion due to its larger beam size (13' FWHM). 

5. COMPARISON WITH THEORY AND SIMULATIONS 

The velocity function of halos in a CDM universe scales 
as dn oc v ~ 4 dv, where v refers to the halo maximum 
rotational velocity. Even though a straightforward com- 
parison of the CDM VF with the ALFALFA WF is not 
possible, such a steep scaling suggests a substantial dis- 
crepancy between the theoretical and observed distribu- 
tions at low velocities. 

In order to make a meaningful comparison between the 
theoretical prediction and the ALFALFA measurement, 
it is necessary to take into account a number of important 
effects: 

i) DM halos exhibit significant substructure. High- 
resolution simulations have shown that several lo- 
cal density maxima (subhalos) develop within the 
virial radius of an underlying bound overdensity. As 
a result, massive halos (Mhaio ^ 10 13 M ) typi- 
cally host groups or clusters of galaxies rather than a 
single astronomical object. In general, a one-to-one 
correspondence between simulated halos and visible 
galaxies is not always possible. 

ii) The collapse of baryons to the central region of DM 
halos affects the galactic potential and leads to a 
modification of the true galactic rotation curves com- 
pared to the ones obtained in dissipationless DM 
simulations. 

iii) The detectability of a galaxy in an HI survey depends 
on its atomic hydrogen content. Galaxies that are 
deficient in HI may be underrepresented in an HI- 
selected sample. 

iv) The relationship between the maximum of the rota- 
tion curve of a galaxy and its HI velocity width is 
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non-trivial. Apart from the obvious dependence on 
disk inclination, the measured width depends on the 
spatial distribution of atomic hydrogen in the galac- 
tic potential. In particular, HI disks do not always 
extend far enough to sample the asymptotic outer 
part of the galactic rotation curve, and may under- 
estimate the mass of the host halo. 

It is, thus, necessary to populate the DM halos of an 
N-body simulation with modeled galaxies, and compare 
this virtual sample against the ALFALFA measurement. 
Modeling of the atomic hydrogen content of the synthetic 
galaxies is particularly desirable, because it greatly facil- 
itates the comparison between theoretical and observed 
d istributions. 

Obrcsch kow et al.1 (|2009l hereafter O09) have sim- 
ulated the Hl-line profi les for the galaxies in the 
iDe Lucia fc Bla izot ( 2003) semi-analytic catalog, created 
by post-processing t he Millennium N-body simulation 
(|Springel et al.ll2005t ). Figure U displays the WF (cyan 
solid line) resulting from projecting their modeled edge- 
on linewidths, assuming random galaxy inclinations. The 
O09 WF is in fairly good agreement with the ALFALFA 
measurement, but fails to display an exponential cutoff 
at high widths and therefore predicts too many high- 
width galaxies. This issue has been also pointed out in 
ZwlO, who argue that the disagreement is caused by the 
fact that the O09 catalog overestimates the HI masses of 
massive early- type galaxies. They found that restrict- 
ing themselves to synthetic galaxies classified as late- 
types (based on their bulge-to-total stellar mass ratios 
in the DeLucia catalog) produced a much better fit to 
their data. However, Figure [9] suggests that applying the 
"morphological" cut of ZwlO results in too few galaxies 
at intermediate widths (200 km s _1 < w < 600 km s _1 ). 

The red solid line in Figure [5] is the WF correspond- 
ing to an indirect observational estim ate of the ve- 
locity distribution of spiral galaxies by Gonzale z et al.1 
(2000). Their determination of the spiral galaxy VF 
was produced by combining the Southern Sky Redshift 
Survey -B-band LF for spirals in conjunction with the 
lYasuda et al.l (|1997f) Tully-Fisher parameters in the Bt- 
band. This indirect method, based on galaxy scaling 
relations, is reliable only for relatively massive spirals 
(vrot > 70 km s" 1 ) and suffers from numerous sources of 
uncertainty (e.g. scatter in the TF relation, uncertain- 
ties related to the correction of the LF for extinction, 
bandpass conversion uncertainties, etc.). 

5.1. The CDM overabundance problem 

CDM predictions start diverging from the observa- 
tional results at low widths, and so the behavior of the 
theoretical WF for w < 200 km s _1 is of great impor- 
tance. Unfortunately, the very interesting work of O09 is 
only reliable for w > 100 km s _1 due to the limitations 
in the mass resolution of the Millennium simulation. We 
employ instead two recent high-resolution CDM simula- 
tions, that lack however modeling of the HI component 
of their virtual galaxy samples. 

Figure fTOl compares the ALFALFA measurement with 
the WF of the gala xy population corr esponding to the 
Bolshoi simulation (iKlypin et alj 1201(f ). as modeled by 
iTruiillo-Gomez et all (|2010l hereafter TG10). Each Bol- 
shoi halo was assigned realistic stellar and cold gas 



masses, based on empirical relations. Subsequently, two 
models were considered, one where the gravitational po- 
tential of the baryons is simply superimposed on the DM 
potential (solid green line) and one where the DM halo 
adiabatically contracts in response to the presence of the 
baryons (dash-dotted green line). Note that TG10 de- 
fine v ro t as the value of the simulated rotation curve at a 
radius of 10 kpc. The authors argue that their modeling 
scheme and use of viokpc provide a good approximation 
of the measured velocity for galaxies with both flat and 
rising rotation curves. 

Also plotted i n Figure [T0| is the W F of simulated galax- 
ies based on the lZavala et al.1 ((20091 hereafter Za09) con- 
strained N-body simulation (blue solid line). Za09 per- 
form a modest volume (64 hT 1 Mpc on a side) but very 
high-resolution {vu m = 24 km s _1 ) constrained simula- 
tion, designed to reproduce the large-scale structure of 
the local universe. Virtual g alaxies are mode led accord- 
ing to the analytical results of IMo et ail (|1998l ) , assuming 
a disk-to- virial mass ratio of fdisk = Mdi.sk/ M V i r = 0.03 
independent of halo size. Lastly, the maximum ampli- 
tude of the rotation curve (v ro t,max) for each galaxy is 
calculated, by combining the disk and DM halo contri- 
butions. 

Since neither model considers the distribution of the 
velocity field tracer (i.e. HI) in simulated galaxies, we 
convert rotational velocities into HI velocity widths by 
assuming the relationship 

w = 2v ro t sini + w e ff . (4) 

Galaxies are assumed to be randomly oriented with re- 
spect to the line-of-sight (cosi is uniformly distributed 
in the [0,1] interval), while w e ft is a small "effective" 
term used to reproduce the broadening effect of turbu- 
lence and non-circular motions on HI linewidths. The 
use of eqn. U is only justified if the HI disk is extended 
enough to sample the value of v ro t adopted by the model 
under consideration (e.g. wiofepc for TG10 and v ro t,max 
for Za09). We adopt the value w e ff = 5 km s _1 for 
the broadening ternd, which is added linearly for galax- 
ies with v ro t > 50 km s" 1 and in quadrature for lower 
velocity galaxies. 

Figure [TO] puts in evidence the marked departure of the 
theoretical distributions from the ALFALFA measure- 
ment at w < 200 km s _1 , which becomes more dramatic 
with decreasing width. According to the TG10 WF, the 
difference is approximately a factor of ~ 4 at w = 100 
km s _1 , exhibiting an increasing trend. The Za09 WF0, 
implies a difference of a factor of ~ 8 at the lowest width 
where the simulation is complete (w ~ 50 km s _1 ), and 
displays a much steeper low-width slope than the AL- 
FALFA measurement. An extrapolation of the Za09 WF 
to the ALFALFA width limit (w = 20 km s _1 ), would 
result in a discrepancy of a factor of ~ 100. 

5.2. Is CDM viable? 

7 The value of w e ff = 5 km s -1 is derived empirically by 
Vcrheiicn & Sancisi (2001), based on a sample of 22 galaxies with 
flat or decreasing outer rotation curves. 

8 In order to account for the fact that the Za09 sample resides 
in an overdense volume (within a radius of 20 h Mpc from their 
simulated "Local Group" ) , we lower the normalization of their WF 
by a factor of 2, as suggested in their §4.3. 
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The ALFALFA measureme nt of the WF confi rms the 
results of the HIPASS survey (jZwaan et al.ll2010D . which 
obtained its WF at lower sensitivity and velocity reso- 
lution. This fact excludes the possibility that the CDM 
overabundance problem is an artifact of the limited per- 
formance characteristics of past blind 21 cm surveys. 
The reason for the observed discrepancy can be there- 
fore most likely attributed to one of the two following 
factors: 

1. The inaccuracy of standard CDM simulations, pre- 
sumably due to the inadequacy of the assumed DM 
model. 

2. The improper comparison of simulated halos with ob- 
served galaxies. This could be due either to 

a) the inadequate modeling of the baryonic counter- 
parts hosted by DM halos, which leads to wrong 
predictions for maximum rotational velocities, or 

b) the incorrect interpretation of inclination-corrected 
HI linewidths as maximum rotational velocities. 

In what follows, we will consider these possibilities in 
more detail and argue about their prospects as solutions 
of the CDM overabundance problem. 

Most large-scale simulations of cosmic structure con- 
form to the standard ACDM cosmological model. In par- 
ticular, they assume that all dark matter is cold (i.e. has 
negligible free-streaming length) , non self-interacting and 
stable (i.e. non-decaying). These properties are appro- 
priate for a universe where dark matter consists of stable 
weakly interacting massive particles (WIMPs). WIMPs 
are currently the favored DM particle candidate, and 
are expected to have masses in the GeV-TeV range and 
weak scale self-interaction cross-sections, justifying the 
DM attributes most commonly assumed in cosmological 
N-body simulations. 

However, the picture changes considerably if DM is 
composed of relatively light (~ keV) particles, in which 
case it is referred to as warm dark matter (WDM). Struc- 
ture on large scales would be the same as in a CDM uni- 
verse, but on small scales halo formation would be heav- 
ily suppressed due to the non-negligible free-streaming 
length of the light WDM particle. Za09 have considered 
this alternative scenario, and carried out a second run of 
their very high-resolution simulation assuming a 1 keV 
WDM particle. They subsequently populate their halos 
with synthetic galaxies, employing the same modeling 
scheme as in their CDM run ( ^5.1|) . The result is shown 
by the red solid line in Figure [TT1 superposed on the AL- 
FALFA WF (datapoints with errorbars and black solid 
line) and the result of their CDM run (blue solid line). 

Strikingly, the synthetic WF in the WDM case exhibits 
a shallow slope at the low-width end, in good agreement 
with the slope measured by ALFALFA. Such a shallow 
slope results from the suppressed production of low- mass 
halos in a WDM universe, which directly translates into 
a lower abundance of low- width visible galaxies. WDM 
could therefore provide a simple and elegant solution of 
the overabundance problem. 

Despite its appeal in this specific context, the general 
prospects of WDM also depend on its overall viability as 
the dominant constituent of non-baryonic matter in the 



universe. A number of theoretical microscopic models 
for WDM have b een proposed, most commonly involving 
steril e neutrinos (iDodelson fc W idrow 1994; Fuller et al.l 
120031 lAsaka et all 120051 : iKusenkd 120091 ) Constraints 
on the particle's mass can be placed by astrophysical 
and cosmological considerations. In particular, Lya for- 
est data places lower limits on the neutrino mass (a 
lighter particle generally results in suppression of power 
at larger scales), while X-ray observations can place up- 
per mass limits (radiative decay into X-ray photons gen- 
erally becomes more efficient at higher masses). The lim- 
its on the neutrino mass imposed by these observational 
constraints depend on the assumed neu t rino p roduction 
mechanism. [Abazaiian fc Koush iappas (2006) find that 
non-resonantly produced neutrinos are ruled out, using a 
compilati on of Lya forest and X -ray data (see references 
therein). iBoyarsky et al.1 (|2009T ) have considered sterile 
neutrino production in the context of the i/MSM (Min- 
imal Standard Model -I- 3 sterile neutrinos) and argue 
that neutrinos with m sn > 2 keV are viable. 

The second class of potential solutions attribute the 
disagreement between theory and observation to the pro- 
cess used to translate the output of simulations into ac- 
tual galaxies. In particular, a number of important ef- 
fects need to be taken into account (identified as items 
HlUvl in Section [5]) to ensure a successful comparison of 
simulated halos with observed galactic samples. 

Both theoretical works presented in £15.11 address is- 
sues and EH For example, Za09 set an explicit limit on 
the mass of halos hosting individual galaxies at M V i r = 
10 13 h^ 1 M Q . The influence of baryons on the shape 
of galactic rotation curves is also taken into account by 
both works, albeit using slightly different prescriptions 
and definitions of galaxy rotational velocity. Despite the 
use of numerous simplifying assumptions by TG10 (e.g. 
all baryons within lOkpc) and Za09 (e.g. fixed disk-to- 
virial mass ratio for all galaxies) their theoretical WFs 
are in fair agreement with the ALFALFA measurement 
at intermediate widths (200 km s _1 < w < 500 km s _1 ). 

The last two issues are related specifically to the atomic 
hydrogen content of galaxies, which is not modeled by 
either TG10 or Za09. Specifically, issue 1ml concerns the 
detectability of a galaxy in a 21cm survey. In principle, 
there exists the possibility that most of the low-mass 
halos predicted by CDM cosmology correspond to HI- 
devoid, dwarf spheroidal galaxies. In reality, a solution 
involving a multitude of isolated early-type d warf sys- 
tems seems rather unlikely. Direct observations (iGarnettl 
l2002l:lSwaters k Balcells1[2002l:lNoordermeer et al.ll2005H . 
as well as other empirical arguments, suggest that the 
Hi-to-stellar mass ratio grows with decreasing mass for 
galaxies in the field. HI surveys should thus have an 
advantage, rather than a disadvantage, at detecting the 
baryonic counterparts hosted by low-mass DM halos. In 
addition, optical surveys suggest that isolated early-type 
dwarfs in medium/low density envir onments are rela- 
tively rare ()Karachentsev et all 120041) . A second issue 
relates to the fact that satellite galaxies may be under- 
represented in the a. 40 sample, since they are generally 
redder (and have presumably lower gas frac tions) than 
centr al galaxies of the same luminosity (e.g. iFont et all 
2008). This bias could result in a < 30% underestimate 
of the abundance of low- width galaxi es by ALFALFA (e.g 
lYang et aLll2008t iKlypin et al.ll201fj|) . not nearly enough 
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to explain the observed discrepancies. 

Issue |iv] regards the size and detailed spatial distri- 
bution of the atomic hydrogen component in galaxies, 
which determines the way in which its rotation curve is 
converted into an HI velocity width. In particular, whi 
is a fair tracer of the maximum rotational velocity, only 
if the HI disk is extended enough to reach the flat (or 
decreasing) part of the galactic rotation curve. The use 
of Eqn. 0] in the derivation of the theoretical WFs im- 
plicitly assumes this situation to be true; observationally 
however , this is often t i mes n ot the case. For exam- 
ple, the ICatinella et al.l (|2006l ) set of template rotation 
curves, puts in evidence the fact that lower rotational ve- 
locity galaxies tend to have steeper outer rotation curves 
(see their Figure 1 fc 4). The dwarf galaxy sa mples of 
ISpekkens et~atl (|2005t l and lSwaters et al.l (|2009h . suggest 
that the effect becomes more dramatic at the lowest ve- 
locities (see Figure 3 & Figure 4 in the respective refer- 
ences) . 

This systematic trend for lower velocity galaxies to 
host less extended HI disks can be understood in terms 
of the expected baryon depletion of low-mass halos. Re- 
sults from N-body + hydrody namics simulations (e.g. 
iHoeft et all 120061 : iRicottil [2010 1 indicate that halos with 
mass below some critical value lose a significant fraction 
of their cosmic share of baryonic matter, due to envi- 
ronmental and internal feedback processes. In particu- 
lar, UV heating of the intergalactic medium (IGM) after 
reionization is believed to lead to substantial gas removal 
from low-mass halos (v rot < 20 - 30 km s" 1 , correspond- 
ing to M mr < 10 9 - 10 9 - 5 h- 1 M Q ). Internal feedback 
processes such as supernova winds may also be impor- 
tant, but their efficacy is strongly model dependent. 

The above considerations could lead to a solution of the 
overabundance problem that would not require a mod- 
ification of the extremely successful ACDM paradigm. 
In simple terms, the overabundance problem would be 
the result of the inability of HI to trace the maxi- 
mum halo rotational velocity of low-mass systems, which 
leads to a severe underestimate of their true mass. The 
same argument has been identified as a possible solution 
of the "mini- void size" problem by iTikhonov fc Klypinl 
(12009T). while a s imilar effect has been proposed by 
iPenarrubia et all (|2008| ) as a solution to the "missing 
satellites" problem. 

6. THE V aOT - V HALO RELATION IN A CDM UNIVERSE 

Assuming the CDM model to be correct, we can statis- 
tically infer the v rot - Vhaio relationship needed to repro- 
duce the observational galaxy VF. This can be done by 
abundance matching, a statistical procedure which as- 
sumes the existence of a one-to-one relationship between 
galaxy and halo circular velocities, v rot = f{vh a io)- It 
follows that the space density of halos with circular ve- 
locities larger than a given value, V, should be equal 
to the space density of galaxies with rotational veloci- 
ties larger than the value dictated by the relationship, 
n(v ha i > V) = n(v ro t > f(V)). 

Obtaining an observational velocity distribution from 
the ALFALFA measurement is not straightforward. 
Firstly, the ALFALFA measurement regards galaxy ve- 
locity widths (uncorrected for inclination) and not intrin- 
sic rotational velocities; secondly, the ALFALFA survey 
is biased against Hi-poor massive ellipticals that domi- 



nate the counts at high velocities. 

We address the first issue by searching for the veloc- 
ity distribution that best reproduces the ALFALFA WF, 
upon projection using Eqn. [4] We assume that the distri- 
bution follows a modified Schechter function of the form 

m = = ln(10) 0* (-) e-W , (5) 

a log v \ v * J 

and that it corresponds to the VF of Hi-rich, late-type 
galaxies. The set of parameters that provide the best 
match is identified visually, and corresponds to the val- 
ues 0* = 1.2 • 10~ 2 h% Mpc -3 , logv* = 2.32, a = -0.81 
and j3 = 3.1 (thick red dash-dotted line in Figure IT2"j) . 
In order to address the second issue (i.e. obtain a VF 
valid for all morphological types), we use the results of 
Chac (2010), who studied the velocity dispersion function 
(VDF) of early-type galaxies in the SDSS and 2dFGRS 
surveys. Velocity dispersions can be transformed into ro- 
tational velocities by assuming an isothermal mass pro- 
file, in which case v ro t = s/2a. Wc adopt the average of 
the 2dFGRS and SDSS velocity distributions as a rep- 
resentative VF for early-type galaxies, which we plot as 
the green dotted line in Figure [T2l 

We interpolate the two distributions using a single 
modified Schechter function with parameters </>„ = 8.7 ■ 
10" 3 /jf ]Vhxr 3 , logv* = 2.49, a = -0.81 and P = 3.35. 
The interpolated distribution (blue solid line in Figure IT21 
& Figure IT31 represents a composite galactic VF valid 
for all morphological types. Even though we do not for- 
mally measure errors for the derived distribution, we list 
below some important sources of uncertainty. Firstly, 
the statistical uncertainty on the parameters of the late- 
type VF should be at least on the order of the errors 
reported in £13.11 the parameters of the composite VF 
should be expected to carry larger errors, since the de- 
termination of the interpolating distribution is subjec- 
tive to some extent. More importantly though, there are 
a number of sources of uncertainty related to galactic 
physics. For example, the assumption of isothermality 
of e arly-type galaxies is not expected to hold in detail 
(e.g iDutton et al.l 12010ft , which would affect the high- 
velocity end of the composite VF. Moreover, the low 
velocity slope, a, depends partly on the value of w e ff 
employed in Eqn. 0J the value adopted here (w e tf = 5 
km s _1 ) has been empirically determined from a sampl e 
of relatively massive spirals (fVerheiien fc Sancisl feOOD. 
and does not have to be the same for galaxies populating 
the low-velocity end of the VF. Also, as mentioned in 
^5.2[ the inferred VF may be underestimating the true 
abundance of low- width galaxies by < 30 %, since AL- 
FALFA is likely to miss some fraction of the satellites of 
massive spiral galaxies. 

Next, we obtain the theoretical CD M VF from the Bol- 
shoi simulation^] (Klvp in~et alj |2010). In particular, we 
use the distribution of maximum halo rotational velocity, 
Vhaio, of all simulated halos (including subhalos) at the 
present epoch, which is shown as the black solid line in 
Figure H31 Note that the simulation is run for the total 
matter density of the universe (f2 m = Qdm + Qbar — 

9 The Bolshoi simulation is run for the set of cosmological pa- 
rameters h = 0.70, n m = 0.27, ag = 0.82, n = 0.95. 
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0.27), but both DM and baryons are treated as dissi- 
pationless components. Also note that the simulation 
is complete only down to Vhaio = 50 km s _1 , and a 
power-law extrapolation is used at lower velocities (which 
is however expect ed to hold, see for example §4.2 in 
iZavala et al1l2009l ). 

The red thick line in Figure [T?] represents the v rot 
- Vhaio relation obtained by matching the CDM and 
galactic velocity distributions (values listed in Table [1]). 
We have assumed that halos with Vhaio > 360 km s _1 
(M V i r > 10 13 hr 1 Mq) do not host individual galaxies 
but rather groups of galaxies, and are hence excluded 
from the matching process. The cyan shaded region cor- 
responds to different values for this mass cutoff, ranging 
from v ha io, m ax = 290 km s" 1 (M mr rj 5 • 10 12 h' 1 Mq, 
upper boundary^) to v halo, max = 

440 km s _1 (M vrr w 
2 ■ f0 13 h^ 1 Mq, lower boundary). The uncertainty in 
the value of Vhaio.max mentioned above is the only source 
of error considered explicitly here. There are, however, 
additional uncertainties involved in the determination of 
the presented relationship. For example, no scatter in the 
v rot — f(vhaio) relation was considered in the abundance 
matching process. Also, no corrections to v ro t for pres- 
sure support have been made in this work, even though 
gas thermal velocities in low mass galaxies can be com- 
parable with their rotational velocities. 

Figure [14] shows that v ro t follows an approximately lin- 
ear relationship with Vhaio only for intermediate-mass ha- 
los (120 km s _1 < Vhaio ^ 170 km s _1 ). In this range, 
v rot ~ 1.5 Vhaio, in fair agreement with the values es- 
timated for the MW and M31 from dynamical models 
(jKlvpin et al.l [2002, diamonds ) and from the ki nemat- 
ics of MW high velocity stars (iSmith et al.l I2007L trian- 
gle) and blue horizontal branch stars ( Xue et all 120081 
box). However, the v ro t/vhaio ratios obtained here are 
significantly larger than the average values inferred by 
iDutton et al.l (|2010l ) from a compilation of weak lensing 
and satellite kinematics datasets. Note though that their 
results are expressed in terms of a v op t - V200 relation, 
where v op t is defined as the measured rotational veloc- 
ity at 2.2 /-band disk scalelengths for late- type galax- 
ies and 1.65cr for early- type galaxies, and V200 refers to 
the virial velocity of the halo at an enclosed overden- 
sity of 200 times the critical density. In order to display 
their results in FigurcQT] (red & blue hatched regions) we 
have transformed virial velocities into halo maximum ro- 
tational velocities a ssuming average halo concentrations 
(jMaccio et al.ll2008D . 

The most important result of Figure [14] concerns the 
low halo velocity regime. In particular the relationship 
steepens continuously as we move to lower halo velocities, 
assuming a power-law behavior of the form v ro t oc i^ Q j 
at vi ia i i$ 50 km s _1 . As a result, the true mass of 
low- velocity halos is systematically underestimated when 
measured by the inclination-corrected HI linewidth of the 
hosted galaxy; the underestimate can reach a factor of 
~ 2.5 for Vhaio ~ 40 km s _1 . Testing the low- velocity 
end of the v ro t - Vhaio relation would require a sample 
of low-mass galaxies with directly measured dynamical 
masses, e.g. through weak lensing or satellite kinematics. 
However, some indirect observational support could come 
from a rich sample of HI interferometric maps of dwarf 
galaxies: a gradual transition from mostly flat to mostly 



rising rotation curves at v ro t ~ 110 km s~ 1 , would be 
required to explain the steepening of the relation at low 
velocities. Ultimately, observational verification of the 
presented relationship at low velocities would provide a 
check of the validity of the CDM model. 

7. CONCLUSIONS 

We have measured the velocity width function (WF) of 
Hi-bearing galaxies, based on a sample of 10,744 extra- 
galactic sources detected in ~40% of the final ALFALFA 
survey area. The ALFALFA measurement extends to 
widths (uncorrected for inclination) as low as w = 20 
km s _1 , and results in a robust measurement of the low- 
width logarithmic slope of a — —0.85 ± 0.19 (la statis- 
tical error including the effect of cosmic variance). This 
result suggests a significant incompatibility of the obser- 
vational distribution with the much steeper distribution 
expected in a CDM universe. 

We compare the ALFALFA result with the WFs of two 
modeled galaxy populat ions, one populating the Bo lshoi 
CDM simulation halos (|Truiillo-Gomez et al.l 120101 ) and 
the other populatin g the halos of the ve ry-high-resolution 
CDM simulation of Zav ala et al.l <|2009f ) . Indeed, the sim- 
ulated WFs start diverging from the ALFALFA mea- 
surement at widths w < 200 km s _1 . The difference 
in abundance is a factor of ~ 8 at w = 50 km s _1 (which 
corresponds to the resolution limit of the Za09 simula- 
tion), and implies a difference of a factor of ~ 100 when 
extrapolated to the ALFALFA low-width limit (w = 20 
km s _1 ). This discrepancy is closely related to a number 
of other observational challenges to CDM (e.g. "miss- 
ing satellites problem", "mini- void size problem", etc.), 
which we collectively refer to as the CDM overabundance 
problem. 

We further identify the two most promising solutions 
to the problem: the first involves the suppression of low- 
mass halo formation, which is best accomplished by as- 
suming a ^keV WDM particle; the second solution does 
not require a modification of the extremely successful 
CDM model, and relies on the fact that HI disks in dwarf 
galaxies are frequently not extended enough to probe the 
full amplitude of the galactic rotation curve. The lat- 
ter solution, supported by currently limited observational 
evidence, implies that galaxy rotational velocities derived 
from inclination-corrected HI linewidths (v rot ) systemat- 
ically underestimate the maximum rotational velocity of 
their host DM halo (vhaio), below v ro t ~ 110 km s" 1 . 

We furthermore use an abundance matching procedure 
to statistically infer the v ro t - Vhaio relationship needed to 
reconciliate the CDM and galactic velocity distributions. 
We find that for MW-sized galaxies v ro t ps 1.5vhaio, while 
at low velocities v ro t underestimates significantly the true 
maximum rotational velocity of the host halo. 

Determining the correct solution to the CDM over- 
abundance problem rests both on the general prospects 
of WDM as a viable dark matter model, as well as on 
observational verification of the v ro t - Vhaio relationship 
predicted for CDM. The latter goal could be best accom- 
plished through a rich sample of low-mass galaxies with 
directly measured dynamical masses. 
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TABLE 1 

The v ro t - v halo relationship in a CDM universe 



v halo [km s 1 ] 


v rot [km s 1 ] 

{Vhalo max — 360 km S 1 ) 


v ro t [km s 1 ] 

(vhalo,max — 290 km s 1 ) 


Vrot [km s 1 ] 

(Vhalo,max — 440 km S 1 ) 


40 


16 


16 


16 


45 


23 


23 


23 


50 


32 


32 


32 


55 


42 


42 


42 


60 


53 


53 


53 


70 


77 


78 


77 


80 


102 


103 


102 


90 


127 


127 


125 


100 


149 


150 


147 




loo 


i on 

±yu 


loO 


140 


219 


223 


218 


160 


247 


252 


244 


180 


270 


278 


267 


200 


291 


303 


286 


220 


310 


327 


303 


240 


328 


353 


318 


260 


345 


383 


333 


300 


383 




360 


340 


431 




387 


380 






416 


420 






449 



Swatcrs, R. A., Sancisi, R., van Albada, T. S., & van der Hulst, 

J. M. 2009, A&A, 493, 871 
Tikhonov, A. V., & Klypin, A. 2009, MNRAS, 395, 1915 
Tinker, J. L., & Conroy, C. 2009, ApJ, 691, 633 
Trujillo-Gomez, S., Klypin, A., Primack, J., & Romanowsky, 

A. J. 2010, ArXiv e-prints 
Verheijen, M. A. W., & Sancisi, R. 2001, A&A, 370, 765 
Warren, M. S., Abazajian, K., Holz, D. E., & Teodoro, L. 2006, 

ApJ, 646, 881 

Xuc, X. X., Rix, H. W., Zhao, G., Re Fiorentin, P., Naab, T., 
Steinmetz, M., van den Bosch, F. C, Beers, T. C, Lee, Y. S., 
Bell, E. F., Rockosi, C, Yanny, B., Newberg, H., Wilhelm, R., 
Kang, X., Smith, M. C, & Schneider, D. P. 2008, ApJ, 684, 
1143 

Yang, X., Mo, H. J., & van den Bosch, F. C. 2008, ApJ, 676, 248 
Yasuda, N., Fukugita, M., & Okamura, S. 1997, ApJS, 108, 417 
Zavala, J., Jing, Y. P., Faltcnbachcr, A., Yepes, G., Hoffman, Y., 

Gottlober, S., & Catinella, B. 2009, ApJ, 700, 1779 
Zwaan, M. A., Meyer, M. J., & Staveley-Smith, L. 2010, MNRAS, 

403, 1969 

Zwaan, M. A., Meyer, M. J., Staveley-Smith, L., & Webster, 
R. L. 2005, MNRAS, 359, L30 

Zwaan, M. A., Meyer, M. J., Webster, R. L., Staveley-Smith, L., 
Drinkwatcr, M. J., Barnes, D. G., Bhathal, R., de Blok, 
W. J. G., Disney, M. J., Ekcrs, R. D., Freeman, K. C, Garcia, 
D. A., Gibson, B. K., Harnett, J., Henning, P. A., Howlett, M., 
Jerjen, H., Kesteven, M. J., Kilborn, V. A., Knezek, P. M., 
Koribalski, B. S., Mader, S., Marquarding, M., Minchin, R. F., 
O'Brien, J., Oosterloo, T., Pierce, M. J., Price, R. M., Putman, 
M. E., Ryan- Weber, E., Ryder, S. D., Sadler, E. M., Stevens, 
J., Stewart, I. M., Stootman, F., Waugh, M., & Wright, A. E. 
2004, MNRAS, 350, 1210 



Zwaan, M. A., Staveley-Smith, L., Koribalski, B. S., Henning, 
P. A., Kilborn, V. A., Ryder, S. D., Barnes, D. G., Bhathal, R., 
Boyce, P. J., de Blok, W. J. G., Disney, M. J., Drinkwater, 
M. J., Ekers, R. D., Freeman, K. C, Gibson, B. K., Green, 
A. J., Haynes, R. F., Jerjen, H., Juraszek, S., Kesteven, M. J., 
Knezek, P. M., Kraan-Korteweg, R. C, Mader, S., 
Marquarding, M., Meyer, M., Minchin, R. F., Mould, J. R., 
O'Brien, J., Oosterloo, T., Price, R. M., Putman, M. E., 
Ryan- Weber, E., Sadler, E. M., Schroder, A., Stewart, I. M., 
Stootman, F., Warren, B., Waugh, M., Webster, R. L., & 
Wright, A. E. 2003, AJ, 125, 2842 



ALFALFA Velocity Width Function 



13 



12" 01 h 00 h 




Fig. 1. — Spatial distribution of 5868 sources in the Virgo direction region (VdR, left panel) and 2055 sources in the anti- Virgo direction 
region (aVdR. right panel). The Virgo Cluster and the "Great Wall" are the most conspicuous structures in the VdR (located at a distance 
of as 17 Mpc and w 100 Mpc respectively). In the aVdR, the Pisces-Perseus Supercluster (clearly visible at ?s 70 Mpc) as well as the void 
in its foreground dominate the large-scale structure. Dist ances are assigned through a combination of a flow model for the nearby Universe 
and Hubble distances for more distant galaxies (see i]2.2jl . 
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Fig. 2. — Properties of the a. 40 sample: Histogram (a) represents the distribution in heliocentric velocity (vq), while the red solid line 
represents the distribution expected in a homogeneous universe according to the selection function of the survey; the complex large-scale 
structure in the survey volume is apparent. Histogram (b) represents the distribution of velocity width («J5o); note the large number of 
very low-width galaxies (wso < 30 km s — 1 ) detected. Histogram (c) represents the distribution of galaxy HI mass (Mhi); again note the 
detections at very low HI mass (Mhi < 10 8 Mq). 
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Fig. 3. — Distribution of the a. 40 sources in the velocity width vs. integrated flux (1050 — Si n t) plane. The dashed red line is the survey 
completeness limit adopted in this work (Si nt ;; m /l Jy km s -1 ) = 0.06 (w$q/1 km s -1 ) 0,51 , which follows very closely the theoretically 
expected S int Hm oc w 1 / 2 . 
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Fig. 4. — The ALFALFA velocity width function (WF): datapoints represent the space density of Hi-bearing galaxies as a function 
of velocity width (corrected for Doppler and instrumental broadening, but uncorrected for inclination), as inferred from 10,744 galaxies 
detected by the 40% ALFALFA survey. The errors are lcr Poisson error s du e to galaxy counts in individual width bins. The red dotted 
line corresponds to a m odified Schechter fi t to the ALFALFA WF (see 33.11 . The green solid line represents the fit to the HIPASS WF 
based on 4315 galaxies (Zwaan ct al. 2010), over its measured range. 
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Fig. 5. — "velocity width Spanhauer" diagrams for ALFALFA (bottom) and HIPASS (top) on the same scale. The region above the 
horizontal orange line marks the range over which the two width functions disagree. Despite the fact that the nominal value of the HIPASS 
volume is a factor of ~5 larger than the a. 40 volume, ALFALFA detects more very broad profile galaxies. This is due to the limited 
sensitivity of HIPASS, which leads to a "thinning" of detections beyond fs: 100 Mpc and out to the survey boundary (area enclosed by 
vertical cyan lines). 
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Fig. 6. — Effect of width measurement errors on the width function: filled circles with errorbars and the black solid line represent the 
ALFALFA WF and the best-fitting modified Schechter function (same as Figure [2J. The red solid line corresponds to the distribution 
obtained by taking into account the ALFALFA measurement error on w§o. The WF remains mostly unchanged, except perhaps for a slight 
increase at the high width end. The red dashed line corresponds to artificially inflated width error s (twice the a. 40 errors) and is plotted 
in order to illustrate the general systematic trend introduced by width errors on the WF (see N4.1l for more details). 
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Fig. 7. — Effect of distance uncertainties on the width function: Filled circles with error bars and bl ack solid line as in Figure [4] The 
blue solid line corresponds to the result of adding a random velocity dispersion of cr; ocai = 160 km s _1 (Masters 2005) to the a. 40 galaxy 
distances. The dashed blue line corresponds to twice the fiducial velocity dispersion, <T ioca ; = 320 km s _1 . Note the relative immunity of 
the WF against distance uncertainties. The main effect appears to be an overall incr ease in amplitude, while (in contrast to the case of the 
HIMF) no clear trend for a steepening of the low-end slope seems to exist (see £14.21 for more details). 
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Fig. 8. — Cosmic variance: datapoints with error bars and black solid line as in Figure [4] The red and blue solid lines represent the 
WF in the Virgo direction Region (VdR: 07^30™ < a < 16 h 30 m , 4° < 8 < 16° and 24° < <5 < 28°) and the anti- Virgo direction Region 
(aVdR: 22 h < a < 03 h , 14° < 8 < 16° and 24° < 8 < 32°), respectively. The VdR is a locally overdense region while the aVdR is locally 
undcrdense, a fact that is reflected by the difference between the the two WFs at intermediate and low widths (see £|4.3| I. 
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Fig. 9. — Datapoints with errorbars and the black solid line represent the ALFALFA WF in the width range w > 100 km s _1 . The cyan 
solid line represents the Obreschkow et al. (2009, O09) WF, derived from projecting their distribution of modeled HI linewidths (w$q) for 
the synthetic galaxies in the semi-analytic catalog of Dc Lucia & Blaizot (2007). The cyan dash-dotted line represents the subsample of the 
O09 galaxies classified as "late-types" according to their bulge-to-stellar mass ratios in the DeLucia catalog. The red solid l i ne rep resents 
the projection of the indirect observational determination of the velocity function (VF) of spiral galaxies bv lGonzalez et al.l 12000) , Their 
VF was obtained by combining the observed luminosity function (LF) for spiral galaxies with the Tully-Fisher relationship. 
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Fig. 10. — The CDM overabundance problem: datapoints with errorbars and black solid line represe nt the measured ALFAL FA WF (same 
as in Figure|4]l. The green lines represent the WF of a sample of synth etic galaxies modeled by Truiillo-Gomez et al. (2010,, TG10), which 
populate the halos in the Bolshoi CDM simulation (Klypin et al. 2010). Two models were considered by TG10, one where the gravitational 
potential of baryons is simply superimposed on the DM potential (solid line) and one where the subsequent adiabatic contraction of the 
DM halo is taken into account (dash-dotted line). The b l ue sol id line represents the WF of a modeled galaxy population corresponding 
to the higher resolution CDM simulation of Zavala et al. (2009, Za09). Note that both theoretical distributions predict a steeply rising 
low-width end, in stark contrast with the observational result. The discrepan cy ac cording to the Za09 result is a factor of ~8 at w = 50 
km s — 1 , rising to a factor of ~ 100 when extrapolated to w = 20 km s _1 (see 95. 1^ . 
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Fig. 11. — Data points with error bars and black solid line represent the measured ALFALFA WF (s ame as in Figure ffl . The blue solid 
line repre sen ts the WF of a modeled galaxy population based on the high resolution CDM simulation of Zavala et al. (2009, Za09) (same as 
in Figure H"0) |, The red solid line represents the WF corresponding to a second run of the Za09 simulation assuming a 1 keV WDM particle 
(both simulations employ the same scheme to populate halos with synthetic galaxies). The WDM WF displays a shallow low- width slope 
due to the suppressed formation of structure at small scales, and is in much better agreement with the ALFALFA measurement. 
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Fig. 12. — The thick red dash-dotted line represents the velocity function of late-type galaxies (assumed to follow a modified Schechter 
distribution) that best reproduces the measured ALFALFA WF (light gray data points and solid lin e) upon proje ction (thin red dash-dotted 
line). The green dotted line represents the velocity function of early-type galaxies determined by Chac (2010) using SDSS and 2dFGRS 
data. The blue solid line is a modified Schechter interpolation of the two VFs which represents a velocity function valid for all morphological 
types. The modified Schechter parameters for the interpolated distribution are 4>» = 8.7 • 10" 3 ftf Mpc -3 , logu* = 2.49, a = -0.81 and 
j3 = 3.35 (see Section \E\ for more details). 
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Fig. 13. — The velocity function of halos (black line) and galaxies (blue line), expressed as a cumulative distribution. The former 
distribution corresponds to the number density of halos (including subhalos) in the Bolshoi CDM simulation, as a function of their 
maximum rotational velocity at the present epoch (vh a i a ). Note that the Bolshoi simulation is complete only down to v^ a i a = 50 km s — 1 , 
but a power-law extrapolation to lower velocities (black dashed line) is expected to hold. T he latter distribution represents the VF of all 
galaxy types, as a function of their observed rotational velocity (same as blue line in Figure Il2l see discussion is Section |6}. 
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Fig. 14. — v ro t - v^aia relation in a CDM universe: the red solid line corresponds to the relationship between the rotational velocity of 
galaxies measured observationally (v ro t) and the maximum rotational velocity of the corresponding CDM halo (v^alo)- The relation was 
obtained by the abundance matching of the velocity distribution of hal os i n the Bolshoi CDM simulation with the velocity distribution of 
galaxies inferred from ALFALFA and SDSS/2dFGRS data (see Figure [T2l fe 1131) , We have assumed that halos with v^ a i > 360 km s -1 
(M v i r > 10 13 Mq) do not host individual galaxies, but rather groups of galaxies. The cyan shaded area corresponds to different mass 
cutoffs, ranging from v^alo.max = 290 km s — 1 (upper boundary) to Vh a i 0tmax = 440 km s" 1 (lower boundary). The blue and red hatched 
areas correspond to the 2cr error regions for late- and early-type galaxies respectively, according to Dutton ct al. (2010). Their measurement 
was based on a compilation of weak lensing and satellite kinematics measurements of galaxy dynamical masses (see Section [6] for more 
details). The symbols correspond to the values estimated for the MW and M31 bas ed on dynamical models (Klvpin ct al. 2002, diamonds), 
and fo r the MW based on the kinematics of high velocity stars l lSmith et al.|[2007l . triangle with 2a errorbars) and blue horizontal branch 
stars (Xuc ct al. 2008, box with 2a errorbars). 



